A comparison of Thai speech recognition systems using hidden Markov model, neural network, and fuzzy-neural network

نویسندگان

  • Visarut Ahkuputra
  • Somchai Jitapunkul
  • Nutthacha Jittiwarangkul
  • Ekkarit Maneenoi
  • Sawit Kasuriya
چکیده

The recognition of ten Thai isolated numerals from zero to nine and 60 Thai polysyllabic words are compared between different recognition techniques, namely, Neural Network, Modified Rackpropagation Neural Network. Fuzzy-Neural Network, and Hidden Markov Model. The I j-state left-to-right discrete hidden markov model in cooperation with the vector quantization technique has been studied and compared with the multilayer perceptron neural network using the error backpropagation. the modified backpropagation. and also with the fuzzy-neural network with the same configuration. The recognition error on Thai isolated numerals using the conventional neural network. the modified neural network. the fuzzy-neural network. and the hidden markov model techniques are 26.97 percent. 22.00 percent. 8.50 percent, and 15.75 percent respectively.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...

متن کامل

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

شبکه عصبی پیچشی با پنجره‌های قابل تطبیق برای بازشناسی گفتار

Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...

متن کامل

Thai Word Recognition Using Hybrid MLP-HMM

The Hidden Markov Model (HMM) is a popular model for speech recognition systems. However, one of the difficulties in applying HMM is the estimation of the emission probabilities for constructing the Gaussian Mixture Models (GMMs). In this paper, we propose a method to estimate the state emission probabilities in HMM framework using Artificial Neural Networks (ANNs), particularly the Multi-Layer...

متن کامل

Application of Fuzzy Clustering Neural Network in Conjunction Speech Recognition

In order to improve the approximation property of the past fuzzy clustering algorithms when identifying systems, a fuzzy clustering neural network (FCNN) is proposed and is applied to conjunction speech recognition system. Based on the fuzzy system model, FCNN presents every state as a fuzzy system and uses continuous frames as the system input. With improving fuzzy clustering identification al...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998